Mining characteristics of epidemiological studies from Medline: a case study in obesity
نویسندگان
چکیده
BACKGROUND The health sciences literature incorporates a relatively large subset of epidemiological studies that focus on population-level findings, including various determinants, outcomes and correlations. Extracting structured information about those characteristics would be useful for more complete understanding of diseases and for meta-analyses and systematic reviews. RESULTS We present an information extraction approach that enables users to identify key characteristics of epidemiological studies from MEDLINE abstracts. It extracts six types of epidemiological characteristic: design of the study, population that has been studied, exposure, outcome, covariates and effect size. We have developed a generic rule-based approach that has been designed according to semantic patterns observed in text, and tested it in the domain of obesity. Identified exposure, outcome and covariate concepts are clustered into health-related groups of interest. On a manually annotated test corpus of 60 epidemiological abstracts, the system achieved precision, recall and F-score between 79-100%, 80-100% and 82-96% respectively. We report the results of applying the method to a large scale epidemiological corpus related to obesity. CONCLUSIONS The experiments suggest that the proposed approach could identify key epidemiological characteristics associated with a complex clinical problem from related abstracts. When integrated over the literature, the extracted data can be used to provide a more complete picture of epidemiological efforts, and thus support understanding via meta-analysis and systematic reviews.
منابع مشابه
Visfatin: an adipokine that plays a crucial role in increasing the risk of cancer
Obesity is an important public health problem worldwide. Epidemiological studies have demonstrated that obesity is associated with an increased risk of several cancer types. Also, obesity is associated with an increase in cancer mortality. Biological mechanisms and the relationship between obesity and cancer are complex and not well understood. Studies on the role of adiposederived factors in c...
متن کاملFormation of a deep pit lake: case study of Aguas Claras, Brazil
The paper presents the case study of the current formation of a Brazilian pit lake from an iron ore mining activity. The water used for the filling of the lake comes from rain, ground water and the complementary pumpage from a close river. At its final stage, which will be reached around year 2018, Lake Aguas Claras will have a surface area of 0.67 km2 and the depth of 234 m, which will make it...
متن کاملEpidemiology of Uterine Myomas: A Review
s:1448:"Myomas are the most common benign tumors of the genital organs in women of childbearing age, causing significant morbidity and impairing their quality of life. In our investigation, we have reviewed the epidemiological data related to the development of myomas in order to homogenize the current data. Therefore, a MEDLINE and PubMed search, for the years 1990-2013, was conducted using a ...
متن کاملRetaining Customers Using Clustering and Association Rules in Insurance Industry: A Case Study
This study clusters customers and finds the characteristics of different groups in a life insurance company in order to find a way for prediction of customer behavior based on payment. The approach is to use clustering and association rules based on CRISP-DM methodology in data mining. The researcher could classify customers of each policy in three different clusters, using association rules. A...
متن کاملEpidemiological study designs- Examples of medical sciences
E pidemiology is the study and analysis of distribution and determinants of health-related conditions or events including diseases, and the practice of this study to the control of diseases and other health problems (1). One of the basic issues in epidemiology and the beginning of a research project is conducting a suitable design for our study (2). The aim of this study is to brief e...
متن کامل